This paper presents a comprehensive corpus for the study of socially unacceptable language in Dutch. The corpus extends and revise an existing resource with more data and introduces a new annotation dimension for offensive language, making it a unique resource in the Dutch language panorama. Each language phenomenon (abusive and offensive language) in the corpus has been annotated with a multilayer annotation scheme modelling the explicitness and the target(s) of the abuse/offence in the message. We have conducted a new set of experiments with different classification algorithms on all annotation dimensions. Monolingual Pre-Trained Language Models prove as the best systems, obtaining a macro-average F1 of 0.828 for binary classification of ...
Cyberbullying is a serious problem that affects many young people, and social networking sites are a...
We report on our participation in GermEval Task 2018 – Shared Task on the Identification of Offensiv...
In recent years the automatic detection of abusive language, offensive language and hate speech in s...
This paper presents a comprehensive corpus for the study of socially unacceptable language in Dutch....
Annotation of tweets in Dutch for the development of tools for the automatic annotation of abusive a...
As socially unacceptable language become pervasive in social media platforms, the need for automatic...
Abusive language detection is an unsolved and challenging problem for the NLP community. Recent lite...
Abusive language detection is an unsolved and challenging problem for the NLP community. Recent lite...
After the successful completion of the Spoken Dutch Corpus (1998 – 2003) the time is ripe to take so...
Creating datasets for language phenomena to fill gaps in the language resource panorama of specific ...
The uploaded presentation was shown at the ADDA3 conference held in St. Petersburg, Florida on 13-15...
In this paper the ANNO Project ("Een Geannoteerde Publieke Gegevensbank voor het Geschreven Ned...
Purpose: Offensive discourse refers to the presence of explicit or implicit verbal attacks towards ...
Cyberbullying is a serious problem that affects many young people, and social networking sites are a...
We report on our participation in GermEval Task 2018 – Shared Task on the Identification of Offensiv...
In recent years the automatic detection of abusive language, offensive language and hate speech in s...
This paper presents a comprehensive corpus for the study of socially unacceptable language in Dutch....
Annotation of tweets in Dutch for the development of tools for the automatic annotation of abusive a...
As socially unacceptable language become pervasive in social media platforms, the need for automatic...
Abusive language detection is an unsolved and challenging problem for the NLP community. Recent lite...
Abusive language detection is an unsolved and challenging problem for the NLP community. Recent lite...
After the successful completion of the Spoken Dutch Corpus (1998 – 2003) the time is ripe to take so...
Creating datasets for language phenomena to fill gaps in the language resource panorama of specific ...
The uploaded presentation was shown at the ADDA3 conference held in St. Petersburg, Florida on 13-15...
In this paper the ANNO Project ("Een Geannoteerde Publieke Gegevensbank voor het Geschreven Ned...
Purpose: Offensive discourse refers to the presence of explicit or implicit verbal attacks towards ...
Cyberbullying is a serious problem that affects many young people, and social networking sites are a...
We report on our participation in GermEval Task 2018 – Shared Task on the Identification of Offensiv...
In recent years the automatic detection of abusive language, offensive language and hate speech in s...